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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Sandman, Olga 

6 Corely, Neil C. 

7 Shah, Purvi 

9 (ii) TITLE OF INVENTION: HUMAN MITOCHONDRIAL MALATE DEHYDROGENASE 

11 (iii) NUMBER OF SEQUENCES: 4 

13 (iv) CORRESPONDENCE ADDRESS: 

14 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

15 (B) STREET: 3174 Porter Drive 

16 (C) CITY: Palo Alto 

17 (D) STATE: CA 

18 (E) COUNTRY: USA 

19 (F) ZIP: 94304 

21 (V) COMPUTER READABLE FORM: 

22 (A) MEDIUM TYPE: Diskette 

23 (B) COMPUTER: IBM Compatible 

24 (C) OPERATING SYSTEM: DOS 

25 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
27 (vi) CURRENT APPLICATION DATA: 

C--> 28 (A) APPLICATION NUMBER: US/09/915,694 

C--> 29 (B) FILING DATE: 25-Jul-2001 

30 (C) CLASSIFICATION: 

32 (vii) PRIOR APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 08/922,957 

34 (B) FILING DATE: 

36 (viii) ATTORNEY/AGENT INFORMATION: 

37 (A) NAME: Billings, Lucy J. 

38 (B) REGISTRATION NUMBER: 36,749 

39 (C) REFERENCE/DOCKET NUMBER: PF-0379 US 

41 (ix) TELECOMMUNICATION INFORMATION: 

42 (A) TELEPHONE: 650-855-0555 

43 (B) TELEFAX: 650-845-4166 

44 (C) TELEX: 
47 (2) INF0R14ATI0N FOR SEQ ID NO : 1: 

49 (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 338 amino acids 

51 (B) TYPE: amino acid 

52 (C) STRANDEDNESS : single 

53 (D) TOPOLOGY: linear 

55 (vii) IMMEDIATE SOURCE: 

56 (A) LIBRARY: THPlPLBOl 

57 (B) CLONE: 11587 

59 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

61 Met Leu Ser Ala Leu Ala Arg Pro Ala Ser Ala Ala Leu Arg Arg Ser 

62 1 5 10 15 

63 Phe Ser Thr Ser Ala Gin Asn Asn Ala Lys Val Ala Val Leu Gly Ala 
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101 Pro Glu Leu Lys Ala Ser He Lys Lys Gly Glu Asp Phe Val Lys Thr 

102 325 330 335 

103 Leu Lys 

106 (2) INFORMATION FOR SEQ ID NO: 2: 

108 (i) SEQUENCE CHARACTERISTICS: 

109 (A) LENGTH: 1334 base pairs 

110 (B) TYPE: nucleic acid 

111 (C) STRANDEDNESS : single 

112 (D) TOPOLOGY: linear 

114 (vii) IMMEDIATE SOURCE: 

115 (A) LIBRARY: THPlPLBOl 

116 (B) CLONE: 11587 
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118 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

120 GGCCCCAGAG AGCAGGCGCT GGGCAGTGTG GAGGTCGTTG GAGTCACTTC CGCGTCACCA 60 

121 GCTCCTGTGC CTGCCAGTCG GTGCCCCTCC CGCTCCAGCC ATGCTCTCCG CCCTCGCCCG 120 

122 GCCTGCCAGC GCTGCTCTCC GCCGCAGCTT CAGCACCTCG GCCCAGAACA ATGCTAAAGT 180 

123 AGCTGTGCTA GGGGCCTCTG GAGGCATCGG GCAGCCACTT TCACTTCTCC TGAAGAACAG 240 

124 CCCCTTGGTG AGCCGCCTGA CCCTCTATGA TATCGCGCAC ACACCCGGAG TGGCCGCAGA 300 

125 TCTGAGCCAC ATCGAGACCA AAGCCGCTGT GAAAGGCTAC CTCGGACCTG AACAGCTGCC 360 

126 TGACTGCCTG AAAGGTTGTG ATGTGGTAGT TATTCCGGCT GGAGTCCCCA GAAAGCCAGG 420 

127 CATGACCCGG GACGACCTGT TCAACACCAA TGCCACGATT GTGGCCACCC TGACCGCTGC 480 

128 CTGTGCCCAG CACTGCCCGG AAGCCATGAT CTGCGTCATT GCCAATCCGG TTAATTCCAC 540 

129 CATCCCCATC ACAGCAGAAG TTTTCAAGAA GCATGGAGTG TACAACCCCA ACAAAATCTT 600 

130 CGGCGTGACG ACCCTGGACA TCGTCAGAGC CAACACCTTT GTTGCAGAGC TGAAGGGTTT 660 

131 GGATCCAGCT CGAGTCAACG TCCCTGTCAT TGGTGGCCAT GCTGGGAAGA CCATCATCCC 720 

132 CCTGATCTCT CAGTGCACCC CCAAGGTGGA CTTTCCCCAG GACCAGCTGA CAGCACTCAC 780 

133 TGGGCGGATC CAGGAGGCCG GCACGGAGGT GGTCAAGGCT AAAGCCGGAG CAGGCTCTGC 840 

134 CACCCTCTCC ATGGCGTATG CCGGCGCCCG CTTTGTCTTC TCCCTTGTGG ATGCAATGAA 900 

135 TGGAAAGGAA GGTGTTGTGG AATGTTCCTT CGTTAAGTCA CAGGAAACGG AATGTACCTA 960 

136 CTTCTCCACA CCGCTGCTGC TTGGGAAAAA GGGCATCGAG AAGAACCTGG GCATCGGCAA 1020 

137 AGTCTCCTCT TTTGAGGAGA AGATGATCTC GGATGCCATC CCCGAGCTGA AGGCCTCCAT 1080 

13 8 CAAGAAGGGG GAAGATTTCG TGAAGACCCT GAAGTGAGCC GCTGTGACGG GTGGCCAGTT 1140 

139 TCCTTAATTT ATGAAGGCAT CATGTCACTG CAAAGCCGTT GCAGATAAAC TTTGTATTTT 1200 

140 AATTTGCTTT GGTGATGATT ACTGTATTGA CATCATCATG CCTTCCAAAT TGTGGGTGGC 1260 

141 TCTGTGGGCG CATCAATAAA AGCCGTCCTT GATTTTATTT TTCAAGGTCC CTTCTGTAAA 1320 

142 TGCAAAAAAA AAAA 13 3 4 
144 (2) INFORMATION FOR SEQ ID NO: 3: 

14 6 (i) SEQUENCE CHARACTERISTICS: 

147 (A) LENGTH: 338 amino acids 

148 (B) TYPE: amino acid 

14 9 (C) STRANDEDNESS : single 

150 (D) TOPOLOGY: linear 

152 (vii) IMMEDIATE SOURCE: 

153 (A) LIBRARY: GenBank 

154 (B) CLONE: 56643 

156 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

158 Met Leu Ser Ala Leu Ala Arg Pro Val Gly Ala Ala Leu Arg Arg Ser 

159 15 10 15 

160 Phe Ser Thr Ser Ala Gin Asn Asn Ala Lys Val Ala Val Leu Gly Ala 

161 20 25 30 

162 Ser Gly Gly lie Gly Gin Pro Leu Ser Leu Leu Leu Lys Asn Ser Pro 

163 35 40 45 

164 Leu Val Ser Arg Leu Thr Leu Tyr Asp lie Ala His Thr Pro Gly Val 

165 50 55 60 

166 Ala Ala Asp Leu Ser His lie Glu Thr Arg Ala Asn Val Lys Gly Tyr 

167 65 70. 75 80 

168 Leu Gly Pro Glu Gin Leu Pro Asp Cys Leu Lys Gly Cys Asp Val Val 

169 85 90 95 

170 Val lie Pro Ala Gly Val Pro Arg Lys Pro Gly Met Thr Arg Asp Asp 

171 100 105 110 

172 Leu Phe Asn Thr Asn Ala Thr lie Val Ala Thr Leu Thr Ala Ala Cys 
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229 lie Cys lie lie Ser Asn Pro Val Asn Ser Thr lie Pro Met Thr Ala 

230 100 105 110 

231 Glu Val Phe Lys Lys His Gly Val Tyr Asn Pro Asn Lys lie Phe Gly 

232 115 120 125 

23 3 Val Thr Thr Leu Asp lie Val Arg Ala Asn Ala Phe Val Ala Glu Leu 

234 130 135 140 

235 Lys Gly Leu Asp Pro Ala Arg Val Ser Val Pro Val He Gly Gly His 

236 145 150 155 160 

237 Ala Gly Lys Thr He He Pro Leu He Ser Gin Cys Thr Pro Lys Val 

238 165 170 175 

239 Asp Phe Pro Gin Asp Gin Leu Ser Thr Leu Thr Gly Arg He Gin Glu 

240 180 185 190 

241 Ala Gly Thr Glu Val Val Lys Ala Lys Ala Gly Ala Gly Ser Ala Thr 

242 195 200 205 

243 Leu Ser Met Ala Tyr Ala Gly Ala Arg Phe Val Phe Ser Leu Val Asp 

244 210 215 220 

245 Ala Met Asn Gly Lys Glu Gly Val Val Glu Cys Ser Phe Val Lys Ser 

246 225 230 235 240 

247 Gin Glu Thr Asp Cys Pro Tyr Phe Ser Thr Pro Leu Leu Leu Gly Lys 

248 245 250 255 

249 Lys Gly He Glu Lys Asn Leu Arg He Gly Lys He Ser Pro Phe Glu 

250 260 265 270 

2 51 Glu Lys Met He Ala Glu Ala He Pro Glu Leu Lys Ala Ser He Lys 

252 275 280 285 

253 Lys Gly Glu Glu Phe Val Lys Asn Thr Lys 

254 290 295 
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